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Qs, ■ In this paper we develop the theory of clusterization of peaks in a Gaussian 

random field. We have obtained new mathematical results from this theory and 
the theory of percolation and have proposed a topological method of analysis of 
sky maps based on these results. We have simulated 10° x 10° sky maps of the 
cosmic microwave background anisotropy expected from different cosmological 
^ models with 0.5° — 1° resolution in order to demonstrate how this method can 

5— ( ■ be used for detection of non-Gaussian noise in the maps and detection of the 

Doppler-peak in the spectrum of perturbation of AT/T. 

PACS number(s) 98.80.Cq, 97.60.Lf, 98.70.Vc 

1 Introduction 

Observations of the cosmic microwave background (CMB) anisotropy are fundamental 
in understanding the formation of structures and the nature of the dark matter in the 
Universe. The distribution of the fluctuations of AT on angular scales larger than a 
few degrees characterizes fluctuations of the primordial density on scales larger than 
the acoustic horizon at the moment of the last scattering. In this paper we discuss 
the CMB fluctuations on the angular scale 6 ~ 1°. This angular size corresponds 
to the scale at recombination. The distribution of the CMB anisotropy on this scale 
preserves the information about the ionization history of the Universe and about the 
nature of the dark matter. Various scenarios of recombination and different amounts 
of baryons in different cosmological models lead to different properties of the anomaly 
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in the AT distribution. (Namely, the different height of the so-called Doppler-peak in 
the power spectrum C\ at the multipole number I m 220). (Efstathiou, Bond, White, 
1992; Coulson et al, 1994; Gorski, 1993; Scott et al., 1995). 

Several groups have reported on observational data on angular scales about 1° 
(Devlin et al., 1994; Clapp et al., 1994; Dragovan et al., 1994; De Bernardis et al., 
1994; Cheng et al., 1994; Schuster et al., 1993; Wollack et al., 1994). The interpre- 
tations of these experimental results and the comparison with the expected power 
spectrum of the AT/T fluctuations in different cosmological models are unclear. 

Several authors propose different methods to detect the peak-like anomaly in 
the angular distribution. These methods are based on correlation analysis (Hinshaw 
et al., 1995; Naselsky and I.Novikov 1993, J0rgensen et al., 1995). Recently, the 
first attempt to understand the topological properties of AT/T maps caused by the 
presence of the Doppler-peak, was made by Naselsky and D. Novikov (1995). Many 
important questions related to this problem were listed in the paper by Hinshaw, 
Bennett and Kogut (1994). Some of them are: 1) how to distinguish the effect 
caused by the presence of the Doppler-peak from the isolated point sources, and 2) 
what is the role of the cosmic variance in a statistical analysis of small regions of the 
sky. 

In this paper, we propose the topological method of analysis of the sky maps. 
This method is based on the assumption that the initial fluctuations are Gaussian. 
In this case, the background radiation will form a two-dimensional Gaussian random 
field. The statistical properties of a Gaussian random field were first investigated by 
Rice (1944,1945) for a one-dimensional field to analyze electrical noise in communica- 
tion devices. A.Doroshkevich (1970) was the first who applied this theory extensively 
in the study of formation of cosmic structures. J.Bardeen J.Bond, N. Kaiser and 
A.Szalay (1986) (referred to as BBKS in what follows) in their classical paper devel- 
oped the theory for a three-dimensional field and J.Bond and G. Efstathiou (1987) 
(referred to as BE in what follows) developed the theory for a two-dimensional field. 

In the work presented here, we develop the theory of clusterisation of peaks in 
a random Gaussian field, investigate the influence of the spectral parameters on the 
AT/T fluctuations for different cosmological models, propose methods of filtering of 
the sky maps, investigate the properties of the two-point correlation function for a 
small region of the sky, investigate the topological properties of the AT/T angular 
distribution caused by the presence of the Doppler-peak in the spectrum of fluctu- 
ations of AT/T, and propose the method of percolation and cluster analysis which 
allows us to detect the unresolved point sources (non-Gaussian noise which can closely 
imitate the presence of a Doppler-peak in the spectrum) and remove them from the 
observational data. 

Note, that percolation has become a popular term among astronomers and cos- 
mologists. It was first introduced in cosmology by Zel'dovich (1982) and has been 
succesfully applied during several years for the investigation of density distributions 
in the non-linear stage of gravitational instability. (See the review by Dominik and 
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Shandarin, 1992; Einasto et al., 1984; Klypin, 1985). The AT/T maps percolation 
as a first test of the presence of non-Gaussian noise was first discussed by Naselsky 
and D.Novikov (1995). 

The outline of this paper is as follows. In Section 2, we review some general 
properties of AT/T fluctuations and investigate the properties of the correlation 
function for different cosmological models, namely with and without a Doppler-peak. 
In Section 3, we introduce different kinds of filtering of the AT/T maps and discuss 
the influence of filtering on spectral parameters and the correlation function. In 
Section 4, we develop the properties of one-dimensional slices of a two-dimensional 
Gaussian random field and of a two-dimensional Gaussian random field itself and 
obtain what we consider to be the most important result of this paper: that the 
clusterisation of peaks in a Gaussian random field depends only on one spectral 
parameter 7 and is determined by the Gaussian nature of the field. In the same 
section we also propose a method for detecting "noise peaks". In Section 5, we 
propose a method of analysis of the sky maps. We summarize our main results in 
Section 6. Details of the derivations may be found in the appendices. 



2 The power spectrum and correlation function on 
intermediate scales 



In this paper we assume that the fluctuations in the cosmic microwave background 
(CMB) are the result of a random Gaussian process. This hypothesis has been 
adopted by many authors (see the review by B.E.Hinshaw et al., 1995) and may be 
argued for in the following way: since the fluctuations of the microwave background 
were imprinted in the linear regime, AT/T will be a linear combination of the initial 
perturbations amplitudes, and will, therefore, have the same probability distribution. 
Such a field with a zero mean is completely characterized by its two-point correlation 
function or by the equivalent power spectrum. 

Let us consider the distribution of the temperature of the cosmic microwave back- 
ground on the celestial sphere. As it was mentioned above, we assume that the field 
is a two-dimensional random Gaussian field on a sphere. This field is completely 
characterized by the power spectrum C\. Using this description one can write the 
following well known expression for the temperature of the relic radiation 



where q is the unit vector tangential to the direction of photon motion; a™ are 
independent random Gaussian numbers; (T(q)) is the average temperature of the 
relic radiation, so that (T(q)) = j- JT(q)dQ; Y™ are the spherical harmonics. 

We introduce the following expression for the anisotropy of the CMB: AT(q) = 
(T(g) — (T (q))) / (T (q)) . The two-point correlation function C{6) can be found by 
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averaging of AT(q) • AT(q') over the whole sky under the condition that the angle 
between the directions of q and q' is a constant: 



C obs {9) = (AT(q) ■ AT ((?)), q-q' = cos9. 
Taking into account Eq.(2.1) and that (a™a™') = 5u>5 mm > we get: 



(2.2) 



1 oo / 

CoU0) = -ri: E {aTfQP^e). 



(2.3) 



1=2 m=-l 



The average value of the observable correlation function is 



C{9) = C obs {9) = t-E( 2/ + 1)CiPi{cos9) . 



^ oo 



(2.4) 



1=2 



We begin the summation in Eq.(2.4) from I = 2. The term with I = 1 has to be 
removed before the calculation of the correlation function because the contribution 
to this term in the AT/T fluctuations cannot be separated from the motion of the 
observer relative to the background radiation. 

Note, that due to the presence of the Doppler-peak in the spectrum of perturba- 
tions of AT/T, the two-point correlation function for the CMBR anisotropy has a 
very characteristic feature on the angular scale of the last scattering, 9 ~ 9 U . This 
characteristic feature looks like a local antipeak on this scale. 

In practise the Doppler-peak in the spectrum of perturbations leads to an effective 
decrease of the correlation radius. On Fig.l we have plotted the two-point correlation 
function for three different models (Q b = 0.03, h = 0.5; Of, = 0.1, h — 0.5; and the 
model without a Doppler-peak) in order to illustrate their properties on a scale 9 ~ 1°. 

Let us consider the influence of a finite size of the considered region on the property 
of the observed correlation function. It is obvious, that the observed correlation 
function differs from its average ensemble approximately by the value of the dispersion 



The subscript on the left-hand side means that this value is obtained by averaging 
over the whole sky. The value D (9) is very small for 9 ~ 1°, but if we consider only 
a small part of the sky, then this value increases to: 



where f2 is the angular size of the considered region. On Fig. 2 we have plotted 
the correlation function for = 0.03, its dispersion and realizations of the random 
process for a part of the sky of 10° x 10° for the unfiltered and filtered cases presented 
on Maps la and lb. This result demonstrates the difficulty to detect the Doppler-peak 
using only small regions of the sky. 



D (9) = ClM - C obs 



' 2 =^) 2 T,(2l + l)C^(cos9) 



(2.5) 




(2.6) 
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Note, that this difficulty is caused only by "cosmic variance" . If the presence of 
the noise is taken into account, then this difficulty becomes even worse (see Hinshaw 
et al., 1995). Further we are interested in detection of the peak by using not only 
correlation analysis of the observational data, but also by topological analysis of the 
sky maps. For the further investigations we introduce the filtering of sky maps. 



3 Filtering of the sky maps 

a. Filtered correlation functions 

If we only consider a small part of the sky, then the geometry is approximately 
flat, and we can introduce the Dekart coordinates on a small area of a unit radius 
sphere (9 x 9, 9 <C n) and describe AT(x,y) as a sum of the Fourier series (B.E.): 

AT(x,y) = £ayC*(fc) cos (2tt^±^ + ^ , (3.7) 

where C{k) is the power spectrum; k = ^-\/i 2 + j 2 ; dij are independent random 
Gaussian values; <fij are random phases homogeneously distributed in the interval 
(0, 2n); L ps 9 is the size of the investigated area. 

Note, that Eq.(3.7) takes into account only modes smaller than L and cannot be 
applied for simulation of the map L x L. Nevertheless, after simulation of the map 
L x L by (3.7), it is possible to use only the part of this map I x I, where I <C L since 
the simulated distribution of AT in this part contains all modes and can be used for 
imitation of the AT distribution on the celestial sphere. 

The correlation function C b s (r) = (AT(x,y) ■ AT(x',y')} can be found by aver- 
aging over the square L x L similar to Eq.(2.2). 

C obs (r)= 1 -J2 a l C ( k ) J °(kr), (3.8) 

1 ij 



where r = y (x — x') 2 + (y — y') 2 . After averaging over the ensemble, we have 

C(r) = C~Jr) = W C(k)J (kr). (3.9) 

where k was given above as function of i and j. Eq.(3.9) is in good agreement with 
Eq.(2.4) because if 9 < vr and / > 1 then Pi(cos9) w J (W), and 19 kr. 

Thus, the fluctuations of AT can be described by Eq.(3.7), where C(k) ~ Ci, 
k ~ V£n (£n is the present horizon), and Q are the well known coefficients of the 
correlation function decomposition to series of Legendre polinomials, for different 
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models of the Universe. It is obvious that the observational correlation function 
differs from its average over the ensemble by the value of the dispersion: 

D(r) = ClJrj - (C^Jrj) 2 = Wc 2 (k)J 2 (kr) . (3.10) 

For (r <C L) the correlation function and its dispersion can be written as follows: 
C(r) = tx J kC{k)J {kr)dk 

D(r)=7rJkC 2 (k)J 2 (kr)dk (3.11) 
C obs (r) ~ C(r) ± y/S(rj. 

Note that for investigation of scales as the size of the recombination horizon, 
R = Rlsi the size of area has to be 9 LS <C I ~ <^ tt, &ls — 4^ 

For the further discussion we introduce smoothing of the map by a Gaussian filter: 

H (S, r) = -^e-^- r 2 = {x> - xf + {y> - yf . (3.12) 

Then the filtered field AT(x,y) is calculated from the initial, i.e., observable field 
AT(x,y)by. 

AT(x,y) = — J ^ J ^ AT{x\y')e-^dx'dy' . (3.13) 

Formally, the limits of integration in Eq.(3.13) are (—1/2,1/2), but, if 5 L, we 
can use ±oo instead of (—1/2,1/2) as the limits of integration. Thus, the correlation 
function for the filtered map is: 

C(r, 5) = (AT(x, y) ■ AT(x', y')) = tx \ kC(k)e' — J (kr)dk . (3.14) 

Jo 

Another kind of filtering is the difference between two Gaussian filters: 

H*(5 m i n , S max , r) = H (5 m i n , r) — H (S max , r) . (3.15) 

This filter removes the influence of the long modes. The spectrum of multipoles Ci 
for many popular models has a specific shape: 

Cl ~ ((ITT) ' < 3 - 16 ' 

Therefore, it is convenient to introduce a filter H 2 (5,r) which has the following char- 
acteristic feature: the spectrum of the filtered map, C(k), can be obtained from the 
spectrum of unfiltered map C(k) by: 



C(k) = k 2 C(k)e~— . (3.17) 
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If C(k) = p (spectrum without Doppler-peak), then from Eq.(3.14) and Eq.(3.17) 
we can easily see, that the correlation function for the filtered map has an especially 
simple (Gaussian) form: 

The deviation from the Gaussian in the observed correlation function for a filtered 
map can be caused either by presence of the Doppler-peak or by "cosmic variance" 
and noise. On Fig.l(a,b,c) we have plotted filtered correlation functions with different 
kinds of filters. This filter also improves the statistical properties of the map, which 
is convenient for the further cluster analysis. On the Map.la,b we have shown the 
AT distribution after filtering and unfiltered, respectively. 

Now let us consider, how the filtering influences the spectral parameters. 

b. Spectral parameters 

Using, Eq.(3.11), we can introduce the spectral parameters similarly to (BBKS): 



al = njkC{k)dk 
a\ = nJk 3 C(k)dk 
a \ = nJk 5 C(k)dk 

R* = a , 7 = — 



(3.19) 



These parameters are completely defined by the value of correlation function and its 
second and fourth derivatives at the zero. 

In the next section we will demonstrate, that the value of 7 determines the ge- 
ometry of the AT distribution seen on the map. In Fig.3a,b we have plotted the 
dependence of 7 on the value of resolution for different kinds of filter. From this 
figure we can easily see that the parameter 7, for the model with a Doppler-peak, 
is different from 7 for the model without a peak for an appropriate value of S (for 
a resolution angle which corresponds to I ~ 150). From Fig.l we also see that the 
differences between the correlation functions for the different models is particularly 
evident for this specific value of 5. We name this value as the resonance filter value, 
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4 Clusterisation of peaks in a random Gaussian 
field 

A Gaussian random field is the field for which a joint Gaussian probability distribution 
for random variables Xi is: 

P(xi, ....,x n )dxi....dx n = — — r — -—7-dxi...dx n , 

K ' ' n> ((27r) n detM)V2 
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2Q = J2^(M- 1 ) t] Ax J . (4.20) 

ij 

Only the means of the random variables and their variances are required to 
specify completely the covariance matrix M and the distribution 

Mij = (AxiAxj), Axi = Xi- ( Xi ). (4.21) 

4.1 One-dimensional cross-section of the two-dimensional field 

The problem of the clusterisation of peaks is especially easy in one-dimension. We 
demonstrate it in order to illustrate the properties of clusters of maxima in one 
dimension. Note, that results of this section can be applied to the analysis of one- 
dimensional observation. We now introduce the definition of a cluster of maxima of 
AT(x). Let us consider two points, X\ and x 2 , {x\ < x 2 ) and a threshold level v t 
(where v — ^), such that AT(x) > u t for all x in the interval [x-y \ x 2 \. We call these 
two points connected to each other with respect to the threshold u t . A collection of k 
maxima (of AT(x)) located at the points x±, x 2 , x^ is called a cluster of k maxima 
if all the points x±, x 2 , x^ are connected to each other with respect to the threshold 
level v t : 

AT(x) > u t , xi < x < x k . (4.22) 
In what follows, we will estimate the probability of existence of such a configuration. 

a. One-dimensional distribution 

If we consider a one-dimensional slice of the two-dimensional field, then the dis- 
tribution of AT(x) can be written using Eq.(3.7) 

2tt. \ 2tt 



AT(x) = J2 a ijy C ( k ) cos \i ix + ^y'J k = —\ji 2 + p . 
The two-point correlation function in this case is: 

C(r) = (AT(x)AT(x')) = tt J kC{k)J {kr)dk, r=\x-x'\, 

similar to the two-dimensional case. Next, we introduce new variables: 

, , AT(x) , . AT'ix) , AT"(x) fA . 

v(x) = ^; v(x) = K -L- i(x) = K -L. (4.23) 

cr a i a 2 

b. Conditional probability 
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We shall estimate the probability that two given maxima at the points x\ and 
x 2 are connected together. A way to solve this problem is to find the total num of 
minima above some level u t , n^ in (r), and the total number of all minima, n m i n (r), 
between these two points. The ratio of these two values is an estimate of the desired 
probability. 

We introduce the following notation. The event that the minimum with value 
above v t is located at the point x is called the event A. The event that the maximum 
with value above v t is located at the point X\ is called the event B. The event that 
such a maximum is located at the point x 2 is called the event C. Then the conditional 
probability that minimum at the point x between two given maxima is above some 
threshold v t can be calculated via the Bayes formula: 

The obvious joint probabilities 

P(A; B; C) = (Ax) 3 J |66£|^K ?7i = 0, 6, u 2 , V2 = 0, 6, u, 77 = 0, X 
x dv 1 dv 2 dvdi 1 di2<ii- l (4-25) 
P(B;C) = (Ax) 2 J 1661^(^1,^1 = 0,6,^2,772 = 0,6)^1^2^6^2- 

The limits of the integration in Eq.(4.25) are 

1) for ui, h> 2 , v - from v t to +00, 

2) for 6, 6 ~ from —00 to 0, 

3) for £ - from to +00. 

The conditional probability is the ratio of these two integrals: 

where N^ nin (x)Ax is the differential density of minima above u t (here and below x 
and r are in the units of R*). The constraint that v(x) is a minimum of any arbitrary 
value leads to identical equations except that the integration dv is over all v from 
— 00 to +00. 

The number of minima n^^r) and the total number of minima n m i n (r) of arbi- 
trary value between x\ and x 2 can be found by substituting Ax in (4.21) by dx and 
integrating from x\ to x 2 - 

^min( r ) = / N+ m (x)dx, 

n min (r) = / N min (x)dx, (4.27) 
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where N min (x) is the differential density of minima of an arbitrary value at the point 
x. The probability that any given minimum (from this collection of minima) has a 
value above v t is, therefore, N^ in (r)/N min . The probability that two given maxima 
(at points x\ and x 2 ) are connected together is equal to the probability that all 
n m in(r) minima between them are above u t . All points of considered minima can be 
labeled from x™ m and x™ r ™ and the desired probability is 



nun \ 



AT+ („min\ Af+ . ( x 

P{xi < — y X2 ) = . ™" n{r \ . (4.28) 

Here, P{x\ < > x 2 ) is the probability that v(x) > u t , for x\ < x < x 2 (i.e. the 

probability that x\ and x 2 are connected). In order to estimate this value, we sub- 
stitute n^ in (r)/r and n min (r)/r by N^ in (xi) and N min (xi) respectively and Eq.(4.28) 
simplifies to: 

^ - *> " {dtPj) • (4 - 29) 

Note, that we always have n min (r) > 1 and n^ in (r) / n min (r) < 1. If we consider two 
maxima which are spaced at r ^> 1, then P(A; B; C) = P{A) ■ P(B) ■ P{C), and 
P(B; C) = P(B) ■ P(C) because events A, B, C become independent. In this case 
we have: 

N m i n { x ) Npiin COnst, 

N m i n \X) N m i n COnst, i a on\ 

n +. ( r ) = iV + -r 1 ' 

' "mm V J mm i 

where N^ in and N min are the number of minima above v t and the number of minima 
of arbitrary height respectively, without the condition that there are maxima at X\ 
and x 2 . Combining Eqs.(4.29) and (4.30) we get 

tP • (4-31) 

If r < 1, then most probably only one minimum is located between two maxima 
(n min (r) = 1) and 

P{xi «_ X2 ) « 7V+ m ( r /2) . (4.32) 

c. Shape around two maxima 

The expected value of the field around two maxima at the point X\ and x 2 can be 
obtained from the covariance matrix of 7 x 7 variables: v\ 2 — v{x\p)\ 771,2 — v{ x i,2)', 
£1,2 — £(^1,2) an d v{x). The derivation is given in detail in Appendix A2. The steps 



10 



are as follows: 

1) The conditional probability that the field in point x falls in the range from v to 
v + dv on condition that ^(^1,2) = ^1,2, ^(^1,2) = 0, £(^1,2) = £1,2, is 

K 1 J PHX1,2) = 1/1,2, 77(^1,2) = 0, i(x la ) = ^1,2) ^ ' 

where events B and C are the conditions that there are maxima at the points x± and 
X2 respectively. 

2) Multiplication of the right hand side of Eq.(4.33) by v and integration over v 
gives the expected value of the field at an arbitrary point x, (u(x)). Multiplication of 
the right hand side of Eq.(4.33) by u 2 and integration over v results in the variance 

M*) 2 >. 

The result (shown in Fig. 4) can be applied to the local analysis of one-dimensional 
data, since we can estimate the probability for an appropriate pair of extrema to be 
Gaussian or not. 

d. Two-point peak-peak correlation function 

If a maximum with the height above v t is located at the point r = 0, then the 
differential density of maxima N+ ax (r) above this threshold at the distance r from 
the given maximum can be calculated using the technique of subsection b: 

This density has to be compared with N+ ax , i.e. with the density of maxima above 
u t without the condition that there is a maximum at r = 0: 

K ax = J |£|PM = 0,0<W£- (4.35) 
Then, the two-point peak-peak correlation function is: 

V p _ p (r) = N ^tL~ N+ma \ (4.36) 

max 

The two-point peak-antipeak correlation function can be obtained similarly to Eq.(4.36): 

M+. ( r ) _ M+ . 

y p ^ ap (r) = . (4.37) 

e. Clusters of peaks 
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The problem of clusterisation of peaks has a few difficult points. Calculation of 
the probability of appearance of a cluster of length k is quite difficult when peak-peak 
and peak-antipeak correlations are taking into account. However the calculation of 
the mean length of a cluster for an appropriate level v t is much easier and can be 
done analytically. 

Let us consider a maximum above the threshold u t at the point x = x*, and 
calculate the total numbers of maxima and minima above v t in the vicinity of this 
peak for x from x» - r to x„ + r: 



x*+r 



N+ ax / (%- P (x) + l)dx, (4.38) 



Xs,— r 



. rx*+r 

n min ( r ) = N Ln / (*p-a P (aO + l)dx 



In the vicinity of this maximum clusters of the different length k appear with the 
appropriate probability, which is difficult to determine. However, the total number 
of clusters in this vicinity is given exactly by 

E = O) - <in( r ) + 1 » ( 4 - 39 ) 

k 

where N k is the number of clusters of the length k. Obviously, the summation of the 
number of maxima in all clusters gives the total number of maxima above u t from 
x„ - r to i» + r: 

£^ = 0) + l- (4-40) 

k 

The unit in the right hand side of Eqs.(4.39) and (4.40) accounts for one additional 
maximum at x = x*. Therefore, the mean length of a cluster k(r), is: 

T(r) = Z k h-N k (r) = n+ ax (r) + l 
1 } E k N k (r) n+ ax (r)-n+ m (r) + l 1 " > 

Note, that if n+ ax (r) = n+ in (r) (with n+ ax (r) > n+ in (r) always being true), then 
only one cluster of the length n+ aa .(r) + 1 appears. We are interested in the mean 
value k for the whole considered region [0;L]. This value can be found by using 
Eq.(4.38) and Eq.(4.41) and averaging over all maxima, and letting the integration 
in Eq.(4.38) go from to L. If the statistical properties of the region are "good 
enough" (namely r c L), then 

nLx(r) = N+ ax J\%- P (x) + l)dx » N+ ax L » 1, (4.42) 
n mm (r) = K m [ L (V P - ap (x) + l)dx w N+ in L » 1, 

Jo 
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and 



N + 1 

k = ^ = -, , (4.43) 

max mm v 1 ' ' > 

where a(u t , 7) = A+ iri /A+ a;r . The values N+ ax (v t ) and iV+ in (i/ t ) can be found ana- 
lytically, see Rice (1944, 1945): 

A^= 7= w = <W leie" 2 "^)^, (4.44) 

'(27r) 3 (l - 7 2 ) °i ^ - 7 - 00 



^(2tt) 3 (1 - 7 2 ) ^1 ^ 



,,2 



= n = , ~ r du I \t\e~ 2 df. (4.45) 



The result of this integration is presented in Appendix Al. The integration over 
in Eq.(4.44) gives the differential density of maxima and minima which are plotted 
in Fig. 5a for different values of 7. From Eq.(4.43) we can see that the mean length of 
the clusters depends only on the density of minima and maxima above the threshold 
v t . For a large value of u t , k — > 1 because, as we see from Fig. 5a, almost all the high 
extrema are maxima and a< 1. For a large negative value of v t k — > 00, because, 
in this case, N+ ax (v t ) -> N+ in {y t ) and a -> 1. 

The dependence of k on the level of slice v t for different 7 are plotted in Fig. 5b. 
A low value of 7 corresponds to the high rate of clusterisation. Note, that 7 lies 
in the interval < 7 < 1. The value 7 = 1 corresponds to the power spectrum 
P(k) = const ■ 8(k — k ). In this case we have k — 1 for — \u \ < u t < \u \; k = 00 for 
u t < — \vq\\ k = for u t > \u \. Here, u is the random Gaussian value and depends 
on the appropriate realization. 



4.2 Two-dimensional field 

In this section, we develop the theory of the statistical properties of the two-dimensional 
field. These properties can be applied to further analysis of the sky maps. In what fol- 
lows we obtain an important characteristic statistical feature of the two-dimensional 
Gaussian field: clusterisation of peaks depends only on one spectral parameter 7, 
just as in the one-dimensional case. This spectral parameter of a Gaussian field of 
CMB radiation depends effectively only on the model of recombination. Note, that 
this natural feature of the Gaussian field can be used not only for determination of 
the ionization history, but also as a test of the presence of noise in a map (which, 
most probably, is non- Gaussian). We derive equations for the field in the vicinity of 
two neighbouring maxima. These equations define the details of the structure of the 
cluster in the vicinity of the maxima and can be used for the local analysis of the 
appropriate part of the map. 

a. Surface AT(x,y) 
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A two-dimensional field can be described as a two-dimensional surface in a three- 
dimensional space, AT(x,y), Fig. 6. We will cut this surface at the different levels 
v t . The surface intersects the plane AT(x,y) = v t along the lines of level v t (see 
Map.2a,b). If, for example, two local maxima with the values above this threshold v t 
are confined by a closed line of level v t , then these maxima are considered connected 
together in one cluster. It is obvious that two neighbouring maxima on this surface 
can be connected together only through the saddle point between them, see Map. 2. 
Therefore, if the saddle point between two maxima is above the threshold u t , then 
these maxima are connected together. Let us define a cluster of the length k similar 
to the one-dimensional case. 

Definition: Cluster of the length k is a collection of k maxima confined by a 
closed level-line. 

In what follows we describe the clusterisation of maxima, namely, we find the 
mean length of the clusters as a function of the level of the slice, v t , and spectral 
parameter, 7. 



b. Extremal points 



Let us introduce the following expressions: 

= AT. = All. = AT y . 

?11 - §22 - — — , §12 - -r— • 



If rji = r/2 = at the point (xo,yo), then this point is an extremum, and the field 
v in the vicinity of (x ,y ) is 

v(x, y) = v(x , y ) + iux 1 + 2£ 12 xy + i 22 y 2 . (4.47) 

We rotate the x, y axes by the angle (p = \arctg around the point (xq, yo) and 

find the eigen values of the second derivatives matrix £, Ai and A2: 

^11 = Ai cos 2 ip + A2 sin 2 ip, 

^22 = Ai sin 2 ip + A 2 cos 2 (p, (4.48) 
62 = (Ai - A 2 ) sin (p cos (p. 

All extrema can be divided in three types: maxima Ai^ < 0, saddle points (Ai > 
0, A 2 < 0) or (Ai < 0, A 2 > 0), and minima Ai j2 > 0. 



c. Density of maxima, minima and saddle points. 



Derivation of the density of saddle points is given in Appendix Bl. (The density 
of maxima and minima for the two-dimensional Gaussian field was obtained by B.E., 
1987). Here we present the results of our analytical calculations for the saddle points 
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and mention the results of calculations by B.E. for the maxima and minima. The 
differential density of the saddle points is: 



N sad (v,~i)dv = 



g 2(3-2^)^ _ 



(4.49) 



The integration of Eq.(4.49) from v t to infinity gives the number of saddle points 
above the same threshold v t 



n sa d{vt) 



N sad (v)dis = —^-jA 



V 2 ( 3 - 2 7 2 ). 



(4.50) 



where $(x) is the probability integral by Gradshteyn and Ryzhik, 1980. The full 
density of saddle points is: 



n sad (-oo,j) 



1 a\ 



The differential density of maxima is: 



(4.51) 



N max (v)dv 



V2 a\ 



iTT^aj I 4^3 - 2i 2 



_g 2(3-2 7 2) 



1 + $ 



7*/ 



7(2-2 7 2)(3-2 7 2 



7 (z/ 2 - 1) ,2 
+ — — ; -e'~ 



1 + $ 




7za/1 — 7 2 ^^5- 

+ , — e 2 ( 1 -^ 2 ) 



(4.52) 



Note, that N min (v) = N max (-v) and 



7l T , 



; (^,7) 



N max (u,j)du 



n r 



'-oo,7) 



87iV3o"i 



(4.53) 



The distribution of maxima, minima and saddle points are shown in Fig. 7a for differ- 
ent values of v. As it can be seen from this figure, this distribution is very sensitive 
to the value of 7. In what follows, we demonstrate the importance of sensitivity of 
Nsadiy-, I)-, N max (i>, 7) and iV m j n (z/, 7) to the value of 7. Namely, the rate of clusterisa- 
tion of peaks in a random two-dimensional Gaussian field depends only on the value of 
7. Note that the corresponding behaviour of N max (u, 7) and N min (u, 7) was discussed 
earlier in Section 3.1 for the one-dimensional cross-section. In the one-dimensional 
case, the rate of clusterisation (in a random Gaussian field) also depends only on the 
parameter 7 as we demonstrated. 



d. Clusters of peaks 
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Let us consider an appropriate realization of the two-dimensional random Gaus- 
sian process in a map I x I, see Map. 2. In this figure the maxima and the saddle 
points with values larger than the level of the slice, v t , together with the isolines 
v — Vt are shown. If the value of v t is high, all maxima are separated and only 
clusters of the length k = 1 are observed. Reduction of the level v t leads to the 
appearance of big clusters (maxima begin to connect together and generate clusters). 
Let us consider a slice of the map and calculate the number of clusters N k of the 
length k for k = 1, 2, .... Summation of the number of maxima in all clusters of the 
map gives the total number of maxima: 



oo 

E 

k=l 



N k -k = n max (v t ). (4.54) 



Now, let us calculate the total number of clusters (having arbitrary number of max- 
ima). Each cluster can contain maxima, minima and saddle points. The numbers 
of maxima, minima and saddle points in one cluster are not independent values. If 
one particular cluster contains k = k max maxima and k m i n minima, then it has to 
contain k saa > = k max + k min — 1 saddle points. Therefore, the total number of clusters 
of arbitrary length k for a given slice - level v t is: 

N k = n max (v t ) + n min (v t ) - n sad (v t ) . (4.55) 

k 



Using Eqs.(4.54) and ( [4.55 ), one can write: 



X/ Nfc Umax Urnin Tlsad 



(4.56) 



If only high levels of v t are considered, then the number of minima, n m i n (v t ), is small 
for 7 sufficiently different from zero, (see Fig. 7a) and can be disregarded. 

In this case, two neighbouring maxima are counted in one cluster for the level 
Vt-, only if the saddle point between them is above this level. Therefore, each cluster 
containing k maxima, also contains k — 1 saddle points (clusters of the length 1 does 
not contain any saddle point). In this case the mean length of the clusters is 

k& 1 + ot{y u i), (4.57) 

where 

a(yul) = - f r, (4.58) 

similar to the one-dimensional case. The ratio n sa( i/n max depends not only on the 
level of u t , but also on spectral parameter 7. Thus, a = a(i/ t , 7) also depends on both 
Vt and 7. The dependence of k on the level v t in the general case Eq.(4.57) is shown 
in Fig. 7b, for different values of 7. For low values of 7 the clusterisation occurs faster 
than for high values of 7. This can be concluded from Fig. 7a, where the distributions 
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of extrema for different values of 7 are shown. Therefore, clusterisation of peaks in a 
two-dimensional random Gaussian field (namely the mean length of clusters) can be 
described in terms of only one parameter, 7. 

For a high level of the slice, almost all extrema are maxima, and n m i n — > and 
n sa d — > (k — > 1 and we have separated maxima). For u t — > 0, n max + n min — n sa d 
tends to zero (see Appendix Bl), and k — > 00. Infinite k is equivalent to the existence 
of an infinite cluster of maxima. However, the finite size of the region considered 
prevents the realization of an infinite cluster (a finite region has only a finite number of 
maxima). In an infinite region, the existence of an infinite cluster and the percolation 
are essentially equivalent. Since percolation is also defined for a finite region, the 
concept of an infinite cluster in a finite region can be addressed in terms of percolation. 

e. Percolation 

Let us colour the map of the considered region in two colours: black and white. 
The black area is for v > v u and the white area is for v < v t (see Map. 3). Percolation 
through black (white) zones is usually defined as the possibility to "walk" from one 
side of the map to another (from the lower to the upper border and from the left to 
the right side) only through the black (white) zones. Roughly speaking, percolation 
means the existence of a black (white) cluster with the linear size of the considered 
map, or the existence of an infinite cluster in the case of an infinite map. In terms 
of extremal points, percolation over black (white) zones means the existence of an 
infinite cluster of maxima (minima). The change of percolation over black zones 
into percolation over white zones has to take place at the v v = in an infinite map. 
Gaussian nature of the distribution of the CMB anisotropy assures this property. In a 
particular realization of the Gaussian process in a finite map a change of percolation 
regimes arises when v is slightly different from zero, because of the statistical character 
of the process. The difference of v from zero can be evaluates as follows: 

1. Each hotspot (black zone) and coldspot (white zone) can be considered as an 
approximately independent realization (correlations between regions of the size r c x r c 
is negligible). 

2. The total number of such approximately independent realizations is iV ~ L 2 jr 2 . 

3. The variance is D ~ l/y/N ~ r c /L. 

Therefore, for the appropriate region of the sky the probability of AT fluctuations 
to be Gaussian can be estimated. This can be accomplished using the percolation 
technique: if u p ^> r c /L then, most probably, the additional non-Gaussian noise is 
present in this map. 

Note, that this obvious test can be considered only as the first step of the detection 
of the noise, because the critical point of percolation v v can be equal to zero even 
with non-Gaussian noise (for example, if the noise is symmetrical relatively to v — 0). 

f. Shape around two maxima 
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The expected value of the field around two given maxima in the two-dimensional 
case can be derived similarly to one-dimensional case (Section 4.1). The derivation 
is given in Appendix B2. This result describes the clusterisation of two neighbouring 
maxima and can be applied to the local analysis of small patches of the map, which 
contain only the area of two neighbouring hotspots. 

This area can be checked for presence of noise by evaluating the probability that 
these peaks have a Gaussian nature, as it is explained in the following. As it was men- 
tioned (e.g. by B.E.) the contour curves in the neighbourhood of Gaussian peaks are 
ellipses. The directions of major and minor axes are different for different maxima. 
Thus, two neighbouring maxima have some relative orientation. Suppose that, in an 
appropriate part of the map, these maxima are located at the distance r from each 
other, and that the value of the field and its second derivatives (relative orientations) 
are vi, £{ 2 , £22 anc ^ v 2i £11 > £125 £22 f° r ^ ne ^ TS ^ an d the second maximum respec- 
tively. Then it is possible to predict the mean value of the field and its variance in 
the vicinity of given maxima and to compare this result to observed pairs of hotspots 
in the map. 

The mean value of the field around two maxima is presented in Fig. 8, for different 
kinds of spectra (i.e., for the different values of the spectral parameter 7). Note, that 
this value depends on the behaviour of the correlation function on the scale of this area 
> r c mainly being determined by 7. For low values of 7 two neighbouring hotspots 
stick together faster than for high values of 7. This fact is in good agreement with 
the results of subsection d. The increase of the rate of clusterisation for decreasing 
values of 7 is favorable to faster production of big clusters. 

5 How can we analyse the sky maps? 

In this section we propose a method of analyzing of the observational data on CMB 
anisotropy for one- and two-dimensional experiments. Interpretation of observations 
of a specific patch of sky is complicated for the following reasons: 

1. Most of the cosmological models predict the CMB anisotropy to be a single real- 
ization of a random Gaussian process for which the properties can be predicted only 
for an ensemble average of such realizations. Any single realization differs from its 
ensemble average approximately by the value of the dispersion (the so-called "cosmic 
variance" ) . 

2. Presence of a non-Gaussian noise can be interpreted as fluctuations of CMB itself 
and can give a wrong contribution to the correlation function or, equivalently, to the 
power spectrum. 

As for the first reason, the following comments can be made. A small part of 
the sky may not be representative. This may lead to misinterpretation of properties 
of the two-point correlation function of the CMB as obtained from one patch only. 
Thus, the finite nature of the considered region may lead to misinterpretation of the 
correlation function of e.g. the galaxy distribution on scales comparable to the size of 
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the region. This effect causes difficulties in detection of the so-called "long distance 
correlation" in the galaxy distribution which can appear in cosmological models with 
non-standard inflation (Starobinsky, 1992). 

As for the second reason, the correlation function obtained by measuring the sum 
of the relic radiation itself and non-Gaussian noise will of course be the correlation 
function of the sum of the noise and relic radiation. 

Note, that both of these two reasons may lead to a correlation function which 
closely can imitate the presence of a Doppler-peak in the spectrum of perturbations of 
CMB. In this chapter, we propose the topological method of analysis of the sky maps 
which is useful for (1) detection of non-Gaussian noise, (2) detection of a Doppler-peak 
in the spectrum. 

a. One dimensional experiment 

Assume that we have data from a one-dimensional experiment being a one- 
dimensional slice of the two-dimensional realization (Fig.4, Section 4.1). If the pri- 
mordial signal is the result of perturbations of the CMB radiation only, then the 
correlation analysis is equivalent to the topological analysis because all statistical 
properties of the Gaussian field are completely characterized by its power spectrum. 
In this case, we should only estimate the dispersion of the correlation function (sec- 
tion 2) on the most interesting scales (about 15' to 1°), which is determined by the 
size of the considered region (Hinshaw et al., 1995). This is necessary for the estima- 
tion of the probability that the peak-like anomaly (if it is detected in the correlation 
function of the observed data) occurs due to the presence of the Doppler-peak in the 
spectrum, and not due to "bad" statistical properties of the selected region. 

Next, the rate of the clusterisation of peaks (Section 4.1) for different values of 
the filter (Section 3) can be checked. It is obvious that the clusterisation of minima 
is equivalent to the clusterisation of maxima if the sign of fluctuations is changed. 
This equivalence exists due to the symmetrical statistical properties of the Gaussian 
field with respect to zero. Variation of the value of the filter leads to a change of 
the spectral parameter 7. For a given parameter 7, the rate of clusterisation of 
peaks is determined only by the Gaussian nature of the field; this rate is plotted on 
(Fig. 5b), for different values of 7. Note, that for different cosmological models, the 
dependence on 7(r ) (where r is the resolution) is different (Fig.3a,b). If, for each 
value of 7 (which corresponds to an appropriate value of ro), the rate of clusterisation 
corresponds to the Gaussian statistic (Fig.5b), then this indicates a Gaussian random 
field and the observed correlation function is the correlation function of CMB. 

Now let us discuss how the situation will change in case of non-Gaussian noise 
being present. In this case, the correlation and the topological analysis are not equiv- 
alent to each other. The presence of the noise shows up as presence of "noise peaks" in 
the random field. These "noise peaks" give a contribution to the observed two-point 
correlation function. If the observed correlation function is not analysed properly, the 
presence of the "noise peaks" in the observed data may lead to wrong conclusions. 



19 



In order to avoid a wrong interpretation of the observed data, we propose to perform 
the cluster analysis: 

1) The rate of clusterisation (as discussed above) should be checked. If this test shows 
us that the rate does not correspond to one spectral parameter 7, then additional 
random noise is present in the data. 

2) A search for "noise peaks" should be performed. 

If these tests indicate the presence of noise, then how can we remove the "noise 
peaks"? We propose to check the probability to be Gaussian for each pair of neigh- 
bouring extrema, by using the technique of Section 4.2, Fig.4. The peaks with small 
probability should not be considered in the further analysis. 

b. Two-dimensional data 

Difficulties in the interpretation of the two-dimensional data are similar to that 
in one-dimensional case, but the statistical properties are usually much better. The 
analysis can be analogous to that for the one-dimensional case, and the steps are as 
follows: 

1) Filtering of the sky map, see Section 3. This step can be useful for the following 
reasons. The use of the specific (for example non-Gaussian) filtering removes the 
influence of long modes and, therefore, leads to a decrease of the correlation radius. 
Since we are interested only in the relatively high modes, such a filtering improves the 
statistical properties of the map (for the purpose of the cluster analysis). A filtered 
map contains a larger number of peaks and is more representative than the unfiltered 
one. Thus we perform the cluster analysis on the filtered map. 

2) Cluster analysis. This step consists of three substeps: 

a. Percolation as the first test of the presence of non-Gaussian noise (see Section 4.2, 
Map.3). 

b. Rate of clusterisation. This substep can be performed similarly to the one- 
dimensional case, but using the results of Section 4.2, Fig. 7b. 

c. Detection of the "noise peaks" (if the previous test indicates the presence of noise), 
as for the one-dimensional case, but using the results of Section 4.2, Fig. 8. 

c. Topology for different models 

As it was shown above, the most important topological properties (namely cluster- 
isation of peaks in a random Gaussian field) can be explained in terms of only one 
spectral parameter 7. This spectral parameter is very sensitive to the presence of 
Doppler-peaks in the spectrum of fluctuations of AT for the resonance filter (Section 

3) . It means that the rate of clusterisation will change from one model to the next, 
for the same value of the filter. A big value of 7 corresponds to a low rate of clus- 
terisation (Fig. 7b) and, for the same level of the slice, the expected number of big 
clusters is less than for a small value of 7. For the model with a pronounced Doppler- 
peak, the value of 7 is larger than it is for the model without a Doppler-peak in the 
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spectrum (Fig.3a,b). Therefore, for such models, the clusterisation happens more 
slowly than for the model without a Doppler-peak (Map.4a,b). Thus, the number of 
big clusters for the model without a Doppler-peak is larger than for the model with 
a Doppler-peak. 

Another obvious difference consists in that a number of maxima (minima) per 
unit square is greater for the model with a high Doppler-peak, because the presence 
of this peak leads to a decrease of the correlation angle. 

6 Discussion 

The results of this paper can naturally be separated into two parts. One is the 
presentation of new theoretical results on the theory of clusterisation of peaks in a 
Gaussian random field and on the theory of percolation. All mathematical results are 
analytical. The other is to suggest how such analytical calculations can be applied 
to the analysis of the observational data of the CMB anisotropy. 

6.1 Theory 

The most important new results on the clusterizarion of peaks in a Gaussian random 
filed, in this paper, are the following: 

a. One-dimensional cross-section of the two-dimensional random field 

1. Analytical calculations of the mean length of the cluster of peaks in, one-dimension, 
Eqs. (4.43-4.45). 

2. The shape of the random field in the vicinity of two neighbouring maxima, namely 
the mean value of the field (v(x)) and its dispersion (v(x)) ± yj (v 2 (x)) about this 
mean. These values are to be determined from the Gaussian probability distribution 
of the value of u(x), given that there are two maxima at the points x\ and x 2 of given 
heights and curvatures (second derivatives), Eqs. (4.33, A2.8, A2.9). 

b. Two-dimensional field 

1. Calculation of the densities of saddle points: differential in height N sa ^{y)dv 
,Eq.(4.49), and integral (above the threshold is t ), Eq.(4.50). 

2. The shape around two neighbouring maxima similar to the one- dimensional case, 
that is the expected average value and its dispersion, (Eqs.(B2.5,B2.6)). 

3. Calculation of the mean length of the cluster of peaks in the two-dimensional case, 
Eq.(4.56). 

4. We find, that the level of percolation, u p , and the level of appearance of an 
infinite cluster of peaks are the same: k{y p ) = oo. It is well known, that in the two- 
dimensional case we have v p — (because of the symmetrical properties with respect 
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to the zero). Similar analytical results can be obtained for the 3-dimensional case 
(now it is known only numerically), but is most probably only of theoretical interest 
and is beyond the scope of this paper. 

6.2 Applications to the analysis of the observational data. 

In Section 5 we have presented our suggestions how analytical calculations can be 
used in the analysis of observational data. The proposed topological method can 
provide more information about the observational data than the correlation analysis. 
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8 Appendix Al 

Density of maxima and minima in one-dimension 

The problem of the density of extrema is especially easy in one-dimension, and 
was solved by Rice (1944, 1945). For a review see also J.Bardeen, J.Bond, N. Kaiser 
and A.Szalay (1986, BBKS). 

The differential density of maxima and minima is 

N max (is)dis = 



1 + $ 



7/v 



72(1 -7 2 ). 



dv, {Al.l) 



where <&(#) = ^= Jq e * 2 dt is the probability integral (Gradshteyn and Ryzhik, 1980). 
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The integrated density is: 

r+oo 



J V 



N m ax(v(dis) 



ft 



47T O"! 



+ 7e" 



'2(1 -t 2 ). 

The density of maxima of arbitrary height is: 
N + (-00) = — — 

9 Appendix A2 



1 + $ 



(A1.2) 



(A1.3) 



Shape around two maxima in one-dimension. 

Let us define the event A as the presence of two maxima at the points x\ and x 2 
respectively with parameters: v{xi t2 ) = ^1,2; ^(^1,2) = 0; £(#1,2) — £1,2, an d the event 
z/ with the field at the point x located in the interval from v to v + diA The joint 
probabilities are: 

P(v, B; C)dv{x)dv{x\)...d$ < {x 2 ) = P(u(x), v(xi), ^(x 2 )du(x)di , (xi) . . .d^,(x 2 ) , 

P(B;C)dv( Xl )...dZ(x 2 ) = P(v(x 1 ),...,Z(x 2 )dv(x)di/(x 1 )...dS(x 2 ) , 
1 



P(u,B;C) = 
P(u,B;C) = 



^/(27r) 7 det Mi 
1 

^/(27r) 6 detM 2 



(A2.1) 



where Mi and M 2 are the 7x7 and 6x6 covariance matrices, respectively, Q\ and 
Q2 are the quadratic forms. The conditional probability for v{x) to be in the range 
from u(x) to u(x) + dv{x) is: 

Using (A2.1) and (A2.2) we obtain: 



P{v\B-C)dv{x) = -IJ^e-©-*)^^). 
V ' ; K J v^VdetMi V ; 



(A2.3) 



Now we define the functions 
C(r) 



*(r) 



2 ' 
^0 



ip{r) 



C"(r) _ 

2 ' 

*1 



C /y (r) 



2 ' 
^2 
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S(r) 



C'(r) 



A(r) 



C"\r) 



In terms of these functions the covariance matrix M 1 is 



M 1 = 
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(A2.4) 



(A2.5) 



where functions with subscripts "1", "2" and without subscripts are evaluated at the 



|x - xi , r 2 



x - x 2 |, r 



Fi _ X 2 



points ri, r 2 and r respectively. Here 77 
M 2 can be obtained from M x by excluding the last row and the last column. If we 
introduce the new variables y^ i — 1, 6, and v, so that 



2/1 = V\ ~ V 2 \ 



y 2 = v 1 + v 2 ; 



V3 = Vl-V2 + 



1 + # 



2/2; 



2/4 = 7/1 + 7/2 



1 - * 



2/1; 



, , , 7(1 + 2^) 2A + T ^ 7 (l + 2^) 
2/5 = 6-6 + TTT^ ^2/1 ; —2 2/4; 



2/6 = 6+6 + 



2(1 
7(1- 



2y?) 



2(1 + *) 



1-2^-2^ 
2A + 7^ 

1 + lif - Ij^ 



V = V 



(A2.6) 



then Mi )2 can be written in a diagonal form. In Mi, (y 2 ), i — 1, 6 occupy the first 
six positions, and (z/ 2 ) stands on the seventh position. In matrix M 2 , all six diagonal 
positions are occupied by (yf), i — 1, 6. The difference between the two quadratic 
forms is then, Qi~Q 2 = |^/(^ 2 )- Also det Mi = det M 2 ■ (V 2 ). Eq.(A2.3) has now a 
very simple form: 



P{u\B;C)du(x) 



-.e 2 (» 2 )dv 



(A2.7) 



As it is seen from (A2.6) and (A2.7), v has the mean value: 

z v (m) 



(A2.S) 



24 



and the variance: 

(- 2 ) = 1"E^^- 042.9) 

The values in (A2.8) and (A2.9) are as follows: 
(u yi ) = - * 2 ; (vy 2 ) = + * 2 ; 

Ws) = -5i + 6 2 + ^(^2); 

ri r 2 1 — w 

x - xi x - x 2 5 



« = 7(^1 + + 7^ rT¥y (-y 2 ) + 1 + 2y _ 2 _g_ 



(y 2 ) = 2(l-*); (y 2 ) = 2(l + *); 

(y 2) = 1 + 2^ - 2^; (y 4 2 > = 1 - 2^ - 2^: 

(y2) = 3 _ 2/ _ 7 2 (1 + 2y?) 2 (2A + t^7(1 + 2 V ))\ 



4 2(1-*) 1-2^-2^ 

, 2 y 3 7 2 (1 - 2 V? ) 2 (2A + T ^ 7 (l- 2 v? )) 

^ 6) "4 + ^ 2(1 + *) 1 + 2^-2^ 



2 



'1+* 



In order to construct maxima in xi and x 2 , it is necessary to take into account that 
771,2 = and £i j2 < 0. 



10 Appendix Bl 

In this appendix, we derive the differential and the integrated density of the saddle 
points only, since the density of maxima and minima was obtained by (B.E.). Here 
we present their result and compare it with the result for the saddle points, which 
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can be obtained in a similar fashion. Following (B.E.), the covariance matrix M for 
the values u, r\\ r) 2 , £n, £22 and £ i2 can be written in the following form: 



M 



( 1 








-7/2 


- 7 /2 


\ 





1/2 




















1/2 











-7/2 








3/8 


1/8 





-7/2 








1/8 


3/8 





V 














1/8 j 



(£1.1) 



The joint probability is: 

P{y, Vi, V2, £11, 62, £12) dvdriid^d^nd^d^n = 

2 1 _ 

e w dvdrjidrj 2 d^iid^22di\2 



7T 3 a/1 — 7 2 
,2 



(51.2) 



12- 



n v " 1 2 . 2 . /> e n2 . (£11 + 62 + 7^) 2 . ,, 
<2 = y + ^1 + % + (61 - 62) + 2(1 - 7 2 ) ^ 

The differential density of extrema is: 

N ext {v) = j I det^lP • S( m )5( m ) ■ d m ...d£ 12 . 

We rotate the coordinate system by the angle (p = \arctg |~|^ to align with the 
principal axes of Thus, we get the diagonal form — AiA 2 , ordered by Ai > A 2 , 
and introduce variables Ai, A 2 , (p: 



£11 = Ai cos 2 <p + \ 2 sin 2 <f, 
62 = Ai sin 2 ip + X 2 cos 2 ip, 



(51.3) 



62 = (Ai - A 2 ) sin y? cos </?. 
Thus, the transformation of the volume element is: 
d£iid£ 22 d£ 12 = (Ai - \ 2 )d\id\ 2 d(p. 



(51.4) 



The orientation angle cp is randomly distributed over < y? < it. The differential 
density of extrema is: 



N ext (v)dv = 



47r 2 a/1 - 7 2 o- 2 ./.«/ 



(4+7") 
^ 



e 2 du / e 2(i-7 2 ) ^jy _ x | e 3 rfxrfy. 



(B1.5) was obtained from (B1.1)-(B1.4) by means of substituting: 



Ai = 



x + y 



A 2 = 



y-x 



(51.5) 



(51.6) 
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The limits of the integration over S depends on the type of the extremal points, 
namely 



> Ai > A 2 — maxima 
Ai > 0, A2 < — saddle points 
< A2 < A2 — minima 

If we are interested in the density of the saddle points, then 

N sad (v)dv = 



(B1.7) 



(51.8) 



1 



a. 



2 e ^ dv I xe x ' "dx I e ^o^ 1 ) (x z - y 2 )dy 



X _ (j/+7") 



4tt 2 ^l^al 
After integrating, we get a very simple result: 



N sa d{v)dv = 



1 \o\ 



The integrated density is 



g 2(3-2^)^ . 



/•OO 

n sadist) = / N sad {y)dv 

Jut 



1 o\ 



1 - $ 



^3 



2(3 - 2 7 2 



(SI. 9) 



(51.10) 



where <&(#) = ^= f£ e * 2 dt is the probability integral (Gradshteyn and Ryzhik, 1980). 
The density of the saddle points of arbitrary height is 



n sad (-oc 



1 gj 
47IV3V 2 



(Sl.ll) 



Analogous results were obtained by (B.E.) but for differential and integrated densities 
of the maxima and minima. The differential density of maxima is 



N m ax{v)dv 



1 



y/2 a\ 
Air^a 2 I V3 - 2 7 2 



_ e 2(3-2-y2) 



1 + $ 



71/ 



2 - 2 7 2 )(3 - 2 7 2 



7 (z/ 2 - 1) ^ 



1 + $ 



71/ 



72(1 -7 2 ), 



+ 



7 z>Vl — 7 2 1£_ 



2tt 



Integrated number density must be evaluated numerically, 

n max (yt) = / N max (v)dv. 
The density of maxima of arbitrary height is 

n max { 00, 7; • 



(51.12) 



(51.13) 



(51.14) 
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Note the following important statement: 

n max {v t ) + n min (is t ) - n sad (v t ) = (51.15) 



1 a\ [+™ y 2 , 2 1X 



72(1 -7 2 )/ VV^-T 2 ), 



Using the property <3>(x) = — $(— rr) and (B1.15), we obtain 

tWe(O) + n min (0) - n sad (0) = 0. (51.16) 

11 Appendix B2 

Similar to one-dimension, we define events B, C as the presence of two maxima at 
the points 1 and 2 respectively with parameters: v 1 ; r){] r}\\ - for the first 

maximum and u 2 ; i]f; rfo\ £ 2 : ; £§ 2 ; £ 2 2 for the second one. We introduce the coordinate 
system x, y as following. Let the first maximum is located at the point (0, 0) and the 
second maximum is located at the point r = (r , 0). The conditional probability for 
u(r) to be in the range from u(r) to u(r) + du(r) is 



P(u\B;C)du(r) = -^W $^e- (Ql - Q2 Wr) , (52.1) 
v 1 ; w v^FV detMi v ' y ' 



where M\ and M 2 are the 13 x 13 and 12 x 12 covariance matrices respectively, Q\ 
and Q 2 are the quadratic forms. Elements of these matrices can be written in terms 
of the functions: 

w = — ; d = ; { p = —; { p = — ; a = ; a = 

r"" ~ c" ~ r 
<r 2 ct 2 <r 2 

where 

c = vr y" W(k)J (kr)dk, 
is the correlation function and for ra = 1, 4 its derivatives, 
Ci = -tv j k 3 C(k)J (kr)dk, 

for n = 1, 2 is its derivatives, and 



c 2 = n J k 5 C{k)J {kr)dk. 
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The matrices M 1 and M 2 can be written in a convenient and simple diagonal forms: 

M Uj = {ziZjjSij, i,j, = 1, 13, 

M 2ij = {ziZjjSij, i,j, = 1, 12. 

The difference between two quadratic forms is then Qi — Q 2 = jZi 3 /(z 2 3 ). Also 
det Mi = (z 2 3 ) det M 2 and for conditional probability one can write: 

,2 

1 _ Jf _ 



P(v\B-C)dv{r) = e ^ dv , (£2.3). 

The variable 213 is the linear combination of v = u(x,y) and Zi, % — 1, 13: 

^i3 = -E^ (52.4) 
As it is seen from Eqs.(B2.3) and (B2.4), that v has the mean value: 

and the variance: 

(- 2 ) = E^f- (52.6) 

The variables and (z 2 ) are as follows: 

z x = v l -v 2 ] z\ = 1{\-$)\ z 2 = u 1 + u 2 ; z 2 = 2(1 + ?/>); 
5 ,0, 5 2 



z 3 



5 <5 2 

z 4 = 77} + 77 2 - -z x - (zl) = l-2<p- 2- 

1 — ip 1 — ip 

Z5 = vi~vl, (4) = 1 - 2 (y? - <p)l z 6 = rjj + ril; (z^) = 1 + 2(y? - <p); 
z * , 7(l + 2(^- y )) r 2A + ^t 2( r» ,. 

3 ^(l^-y.)) 2 (2A+ ^gg^ - 
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* = £n + e n + 



<* 2 2 > 



<*§> 



^3; 



3 7 »(l-20-?))' (2A+ ^7?f-") )- 



ti t2 ti t2 2(A — 2A) 

Z 9 = S22 ~~ S22 — Sll + Sll Z4 ~ 



' A f- 2/ - i 2(A - 2A)(2A + ^il±fal) ' 



(4 



<*2><*?> 



< z g) = --2(/-2/ + /) 



7 2 (l + 2(^- v? )) 2 (2(A-A) + ^ 1 t! ( ;- y)) ) ; 



+ 2(/-/)- 



7 2 (l+2(y-y.)) 2 



2A+ 



7<5((l+2(y- ¥ ;)) 
!-</> 



2(A-A)+ ^« 1 {5;-^» 
(if) 



2 ^1 >2 2(A — 2A) 
z w = £22 + £22 ~~ sn — H r^v — ^3- 



2/-4/-J 2(A - 2A)(2A + g^falj 



^8; 



<4 ) > = - + 2(/-2/ + /)- 



7 2 (l-2(^-^)) 2 [2A-(A) 



7g((l-2(y-y)) 



!+2a-/)- ^ (i - ( 2 |-^ 2 



2A+ 



7<5((l-2(y- ¥ >)) 
1+V 



2(I _ A)+2Mi _2(p £ ))j I 



(4 



* -f 1 f 2 2 ^~ A ) ^. 

Z ll — S12 ~~ S12 ^2^ 



(4) = 7 + 2(/ - /) - 



4(A - A) 



z %\ 
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z -f 1 + f 2 2CX - X h - 

Zl2 — ?12 + ?12 T~2\ Zs ' 

\ z bl 



(^ 2 > = ^ + 2(/-/) 



4(A - A) 



For values (iaz*), i = 1, 12 we obtained: 
(i/zi) = - ^2; (^2) = V'i + ^2; 

(uz 3 ) = -—61 + - — —S 2 + - - (1SZ2); 
n r 2 1 + ip 

X X — Tq $ 

(1SZ4) = — Si S 2 - r(^i); 

ri r 2 1 - *0 



(i/z 5 ) = Si H 5 2 ; (iaz 6 ) = Si S 2 ; 

ri r 2 n r 2 



(uz 7 ) 



x 2 -y 2 (x- r ) 2 -y 2 y 2 _ _ 
2 — Vi 2 ^2 + -9-^1 2 ^2+ 



7< 5((l+2(y-<p)) 



7(1 + 2(g-„j) 2A + 



(4) 



X 2 — I/ 2 

(^8> = ~2 ^1 



(4) 

(x - r ) 2 - 



T2 



7 (l-2(y>-yQ) 



(^2) + 



r 2 
'2 



2A + 



y ~ y ~ 

V?2 + ~n(Pl + + 

rf ri 



7 <H(l+2(<p-y)) 



. I/ 2 ~ ^ 2 ,0 y 2 -(x-r ) 2 /o „ 2(A-2A) . 



(4) 



4/-2/-J 2(A-2A)(2A + 7<5((1 t! ( ;- y)) 



l/ 2 -^ 2 /^ ~x y 2 -(z-r ) 2 ~x 2(A-2A). 



r 2 
r 2 



2/-4/-i 2(A-2A)(2A + 7<5((1+2(y ~ y)) 



/ v ~n (^- r o)y, ~x 2(A-A) 
{vzii) = —{2(fi - (pi) - 2 (2^2 - <P2) r^v— <^>l 

r l r 2 \^6 / 

/ v x y to ~ \ , (x-r )y , n _ s 2(A - A) 

(^12) = -2-(2^1 - V?l) + "3 (2<^2 - tfi2) J^— (VZ5); 

'1 '2 \ z 5/ 
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Figure captions 



Fig.l Correlation function for CDM models with Harrison-Zel'dovich spectrum 
of the initial perturbations, fto — 1, h — 0.5. (a) Filtered correlation function with 
filters Hq and H 2 . Dashed line: fi& = 0.1. Dotted line: f2& = 0.03. Solid line: 
models without a Doppler-peak and C t = (b) The same as Fig. la, but for 

filtered correlation functions with filter H*(5 max ,5 m i n ,r). 1: for 5 max ~ I — 10, 
I = 150; 2: for 5 

max r ^ J 

I = 30, 5 

min ~ I — 150. (c) Filtered correlation 
functions for the filter H 2 with the following width 8 (curves from the left to the 
right) :(1) S ~ I = 660, (2) S ~ I = 350, (3) 5 = 0.67° ~ I = 150 corresponding to the 
resonance filter, (4) S ~ I = 100, (5) S ~ Z = 75, (6) 5 ~ Z = 50. 

Fig. 2 Correlation functions for the model fife = 0.03 (ensemble average) shown 
as solid lines and their dispersions C(r) ± \J D{r : ) as dashed lines; realizations are 
shown as *. The field size is 10° x 10°. The upper part of the figure, marked by H Q , 
corresponds to the unfiltered Map la. The lower part, marked by H 2 , corresponds to 
the filtered Map lb. 

Fig. 3 The dependence of the spectral parameter 7 on the value of the filter width 
5. (a) For the filters Hq and H 2 . The solid line is without a Doppler-peak; the 
dotted line corresponds to fij, = 0.03; the dashed line to = 0.1. (b) The same 
as Fig. 3a, but for the filter H*. 1: corresponds to 5 

max ~ Z — 10, 2: corresponds to 

Smax ~ I = 30. 

Fig. 4 (a) A one-dimensional cut of the simulated map 10° x 10°. Arrows indicate 
two maxima, which are tested in Fig.4b. (b) The solid line is an expected value of 
the field in the vicinity of two neighbouring maxima indicated in Fig.4a. The dashed 
lines correspond to expected values ±V variance. 

Fig. 5 Clusterisation of Maxima in one-dimension for different values of 7. (a) 
Differential densities of maxima and minima, solid line for 7 = 0.1, dashed line for 
7 = 0.5, dotted line for 7 = 0.9. (b) the mean length of the clusters: solid line for 
7 = 0.1, dashed line for 7 = 0.5, dotted line for 7 = 0.9. 

Fig. 6 AT/T as a two-dimensional surface in a three-dimensional space. 

Fig. 7 Clusterisation of peaks in the two-dimensional case, (a) Differential density 
of maxima shown as a dotted line; saddle points as a solid line; minima as a dashed 
line, (b) Mean length of clusters of maxima (right-hand side) and minima (left-hand 
side) for different 7: solid line for 7 = 0.1, dashed line for 7 = 0.5, dotted line for 
7 = 0.9. 

Fig. 8 Different slices of the AT/T maps in the vicinity of two maxima, (a) a 
model with Q b = 0.1, (b) a model without a Doppler-peak. 

Map 1 (a) Simulated map of AT/T for a region of 10° x 10° for a model with 
Qb — 0.03. (b) The same as for (a) but with the filter H 2 , 6 = 0.67°. 

Map 2 Simulated map of AT/T for a region of 10° x 10°, for a model with 
Qb = 0.03 and filter H 2 , 5 = 0.67°. *=maxima, X=saddle points; contours correspond 
to the level v t — \. 
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Map 3 Percolation through the black zone {y t > 0). 

Map 4 Simulated maps of 10° x 10° for two different models (see text) (a) O b = 0.1, 
(b) Model without a Doppler-peak. 
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